The Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic Approach
نویسندگان
چکیده
منابع مشابه
A Convex Analytic Approach to Risk-Aware Markov Decision Processes
Abstract. In classical Markov decision process (MDP) theory, we search for a policy that say, minimizes the expected infinite horizon discounted cost. Expectation is of course, a risk neutral measure, which does not su ce in many applications, particularly in finance. We replace the expectation with a general risk functional, and call such models risk-aware MDP models. We consider minimization ...
متن کاملDiscounted Continuous Time Markov Decision Processes: the Convex Analytic Approach
The convex analytic approach which is dual, in some sense, to dynamic programming, is useful for the investigation of multicriteria control problems. It is well known for discrete time models, and the current paper presents similar results for the continuous time case. Namely, we define and study the space of occupation measures, and apply the abstract convex analysis to the study of constraine...
متن کاملContinuous Time Markov Decision Processes with Expected Discounted Total Rewards
Abstract. This paper discusses continuous time Markov decision processes with criterion of expected discounted total rewards, where the state space is countable, the reward rate function is extended real-valued and the discount rate is a real number. Under necessary conditions that the model is well defined, the state space is partitioned into three subsets, on which the optimal value function ...
متن کاملMarkov decision evolutionary games with time average expected fitness criterion
We present a class of evolutionary games involving large populations that have many pairwise interactions between randomly selected players. The fitness of a player depends not only on the actions chosen in the interaction but also on the individual state of the players. Players stay permanently in the system and participate infinitely often in local interactions with other randomly selected pl...
متن کاملNecessary Conditions for Continuous Time Markov Decision Processes with Expected Discounted Total Rewards
This paper discusses a set of necessary conditions for continuous time Markov decision processes with criterion of expected discounted total rewards, where the state space is countable, the reward rate function is extended real-valued and the discount rate is any real number. Under necessary conditions that the model is well defined, the state space is partitioned into three subsets, on which t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Advances in Applied Probability
سال: 2012
ISSN: 0001-8678,1475-6064
DOI: 10.1239/aap/1346955264